CDS

Accession Number TCMCG078C26484
gbkey CDS
Protein Id KAG0497245.1
Location complement(join(32706425..32706447,32706541..32706624,32706773..32706858,32707068..32707188,32707268..32707313,32709859..32709936,32710825..32710931,32711339..32711399,32711933..32711992,32712376..32712458,32715911..32716021,32716648..32716753,32728974..32729025,32729092..32729377,32734216..32734467,32734520..32734545,32742228..32742490,32742519..32742611))
Organism Vanilla planifolia
locus_tag HPP92_001936

Protein

Length 645aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000001.1
Definition hypothetical protein HPP92_001936 [Vanilla planifolia]
Locus_tag HPP92_001936

EGGNOG-MAPPER Annotation

COG_category Q
Description Indigoidine synthase A like protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01055        [VIEW IN KEGG]
KEGG_rclass RC00432        [VIEW IN KEGG]
RC00433        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K16329        [VIEW IN KEGG]
EC 4.2.1.70        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00240        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGTCGTTCCGTTCGGAAAGGGCTCGGAATCCCTCGAGACCACGCCGTACCTAGCGTTCTCCGCACTGCTCGACTTCAAATCCACGCAGACAAGGCCAATTGACCGCGGCTTCCTTGAGTGTTTTGATATCGCAGAGGCGAAGGAGGAACGGACAATTTTAGCCGGCTTTGTGGCGGCGATGGGATGGGTTGTGAGGCAGCGGAGGCCGGCTGGGACCCGACCTCACCAAGGCGAACGCTATGGCGCGCTGCCGCTTGAGCCGGAGCAGGCCATGAGGATGTCACGTCCCAACGTTGCAGGGAGAAGAGGCGCGCCGTTCTTCGGCGGGTTGGATAGGCTGAGGCCTCCCATGGGGAGTCCTGTTTCTGGAATCATCATCCATCTTGCCCGTTTCGTCCTCCTTCGTCTGCTTATCTTCGACGCGACCGACTGGTACGCCACTGACGACGTCTCGGAGGATCAGGAGCCTCCTGGGTCTTCGTTAGACCTCTGGTCCTCGCAACCAATCTCCGAGTTTTTGTGGAATGCGCAACAGAGCATTTCTGTCGCGAAGAAGCTTGAAGATTATGATCCTTGTCTCACCGCGGTGGCTTTGCCTTCCAACGCCCCCGCCATCGCTGTGAGAAGGAGAGTTTGTGTTGGGTTTGACCGATTCTCTGGAACAAAAATGAGTCGAAACCTGAAGGAGAGAGCCTTGGTACAAAGCAGGTCACCCCTGCAGACCTTAGTCTTCCCACGCTGCTTCCCTTCGCCTCAGGCGCTCCCCTGCGAGCCGCTAGTCTGCAGCGACCACTATCACCTCCCGTTAGAGCCTTCACTCTCGCACACCAATCCAACCGTCGCCTTGAACACACCATCTCCGATCGCCCCAGCATACCGTCACACGTGTCTCGGCGTCTCCGTCTGTCTTCAGAGTCTGCCTTCCACAGCAGCATACCTTAATCCTCTTACCCCGGAGATCGACCGCCAGGATGACAATATGCACTTAGGACTTCTAAAAGTAGCACCGATCATATATCATGCGCTTAAACATGGTAATGCAGTTGTTGCTTTGGAGTCGACAATAATTTCTCATGGCATGCCATTTCCCCAAAATCTAAAGACTGCTAAAGAAGTTGAAGCTATTGTAAAGCAGAACGGGGCTATTCCTGCAACAATTGCAATACTTGATGGTATTCCATGCATTGGGCTGGATGATGAACAACTAGAAAAGCTTGCAAAACTTGGACCTAATGCTCAGAAGACATCCCGTAGAGACATTCCTCATGTTATCGCAGCTTGTCAAAATGGTGCAACAACTGTATCTGCTACCATGTTTTTTGCCTCGAAGCTTGGCATACATGTTTTTGTCACCGGTGGTATTGGTGGAGTGCACAGGCATGGTGAGACAACAATGGATATATCATCTGATCTCACTGAACTTGGTAGAACTCCTGTGGCAGTTATTTCTGCTGGTGTTAAGTCTATTTTAGACATTCCCAAGACTCTAGAATATTTGGAAACCCAAGGTGTTACAGTCGCTGCTTACAGGACCAATAGTTTTCCAGCATTTTTCACCAATTCTAGTGGATGCCAGGTCCCTTGTCGCCTTGATACTCCTGAGGAATGTGCAAGGCTTATAAATTCAAACTTGAAGTTAGGGCTTGGGAGTGGAATCCTTATTGCTGTGCCCATTCCAAAACAACATTCAGCCTCTGGAAATCTCATCGAATCTGCAATACAAGAAGCCCTAAAAGAAGCAAAGGATAAACATATAACTGGGAGTGCCGCTACTCCTTTCTTGCTTTCAAGAGTAAATGAGCTAACAGGAGGAGCATCATTAACTGCCAATATTGCCCTTGTCAAGAACAATGCCATGGTTGGTGCTAAAATTGCTGTCGCTCTTGCAAATTTAGGACAAGGCATAAAGAATAGTCATGTTAAGTCGGCACTTTGA
Protein:  
MVVPFGKGSESLETTPYLAFSALLDFKSTQTRPIDRGFLECFDIAEAKEERTILAGFVAAMGWVVRQRRPAGTRPHQGERYGALPLEPEQAMRMSRPNVAGRRGAPFFGGLDRLRPPMGSPVSGIIIHLARFVLLRLLIFDATDWYATDDVSEDQEPPGSSLDLWSSQPISEFLWNAQQSISVAKKLEDYDPCLTAVALPSNAPAIAVRRRVCVGFDRFSGTKMSRNLKERALVQSRSPLQTLVFPRCFPSPQALPCEPLVCSDHYHLPLEPSLSHTNPTVALNTPSPIAPAYRHTCLGVSVCLQSLPSTAAYLNPLTPEIDRQDDNMHLGLLKVAPIIYHALKHGNAVVALESTIISHGMPFPQNLKTAKEVEAIVKQNGAIPATIAILDGIPCIGLDDEQLEKLAKLGPNAQKTSRRDIPHVIAACQNGATTVSATMFFASKLGIHVFVTGGIGGVHRHGETTMDISSDLTELGRTPVAVISAGVKSILDIPKTLEYLETQGVTVAAYRTNSFPAFFTNSSGCQVPCRLDTPEECARLINSNLKLGLGSGILIAVPIPKQHSASGNLIESAIQEALKEAKDKHITGSAATPFLLSRVNELTGGASLTANIALVKNNAMVGAKIAVALANLGQGIKNSHVKSAL